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We combine results from CDF and D0 on direct searches for a standard model (SM) Higgs boson 
{H) in pp collisions at the Fermilab Tevatron at -y/s = 1.96 TeV. Compared to the previous Higgs 
Tevatron combination, more data and new channels {WH —> rybb, VH — > Trbb/jjTT, VH 
jjbb, ttH ttbb) have been added. Most previously used channels have been reanalyzed to gain 
sensitivity. We use the latest parton distribution functions and gg ^ H theoretical cross sections 
when comparing our limits to the SM predictions. With 2.0-3.6 fb^^ of data analyzed at CDF, and 
0.9-4.2 fb"^ at D0, the 95% C.L. upper limits on Higgs boson production are a factor of 2.5 (0.86) 
times the SM cross section for a Higgs boson mass of mn =115 (165) GeY/c^. Based on simulation, 
the corresponding median expected upper limits are 2.4 (1.1). The mass range excluded at 95% 
C.L. for a SM Higgs has been extended to 160 < mn < 170 GeV/c^. 



I. INTRODUCTION 

The search for a mechanism for electroweak symme- 
try breaking, and in particular for a standard model 
(SM) Higgs boson has been a major goal of parti- 
cle physics for many years, and is a central part of 
the Fermilab Tevatron physics program. Both the 
CDF and D0 experiments are reporting new combi- 
nations [21 [3] of multiple direct searches for the SM 
Higgs boson. The new searches include more data 
and improved analysis techniques compared to previ- 
ous analyses. The sensitivities of these new combina- 
tions significantly exceed previous work jH [5]. The 
most recent Tevatron Higgs combination fB^ only in- 
cluded channels seeking Higgs bosons of masses be- 
tween 155 and 200 GeV/c^, and the most recent com- 
bination over the entire mass range 100-200 GeV/c^ 
was reported in April 2008 [7\. 

In this note, we combine the most recent results of 
all such searches in pp collisions at ^/s = 1.96 TeV. 
The analyses combined here seek signals of Higgs 
bosons produced in associated with vector bosons 
{qq W/ZH), through gluon-gluon fusion {gg H), 
and through vector boson fusion (VBF) {qq q'q'H) 
corresponding to integrated luminosities ranging from 
2.0-3.6 fb-i at CDF and 0.9-4.2 fb^^ at D0. The 
Higgs boson decay modes studied are H ^ bb, H ^ 
W+W-, H T+T- and H -> 77. 

To simplify the combination, the searches are sep- 
arated into 75 mutually exclusive final states (23 for 
CDF and 52 for D0; see Table |l] and |ll| referred to 
as "analyses" in this note. The selection procedures 
for each analysis are detailed in Refs. [8] through [21] , 
and are briefly described below. 



TABLE I: Luminosity, explored mass range and references 
for the different processes and final state (I — e, fi) for the 
CDF analyses 



Channel 


Lumi. 


ruH range 


Ref. 




(fb-^) 


(GeV/c^) 




WH ivbb 


2.7 


100-150 


m 


ZH vvbb 


2.1 


105-150 


m 


ZH ^ e+rbb 


2.7 


100-150 


m 


H W+W" 


3.6 


110-200 


m 


WH WW+W~ l^vi^v 


3.6 


110-200 


m 


H + X t+t' + 2 jets 


2.0 


110-150 


|12| 


WH + ZH ^ jjbb 


2.0 


100-150 


|13| 



TABLE II: Luminosity, explored mass range and refer- 
ences for the different processes and final state {£ = e, fi) 
for the D0 analyses 



Channel 


Lumi. 


ran range 


Ref 




(fb-i) 


(GeV/c^) 




WH Ivbb 


2.7 


100-150 


|14| 


WH Tvbb 


0.9 


105-145 


|15| 


VH rrbb / qqrr 


1.0 


105-145 


|15| 


ZH -* wbb 


2.1 


105-145 


|16| 


ZH l+rbb 


2.3 


105-145 


|17| 


WH WW+W- l^vl^v 


1.1 


120-200 


|18| 


H W+W- l^vt^v 


3.0-4.2 


115-200 


|19| 


H 77 


4.2 


100-150 


|20| 


ttH ttbb 


2.1 


105-145 





II. ACCEPTANCE, BACKGROUNDS AND 
LUMINOSITY 

Event selections are similar for the corresponding 
CDF and D0 analyses. For the case of WH Mb, 
an isolated lepton {£ = electron or muon) and two jets 



are required, with one or more 6-tagged jet, i.e., iden- 
tified as containing a weakly-decaying B hadron. Se- 
lected events must also display a significant imbalance 
in transverse momentum (referred to as missing trans- 
verse energy or I^t)- Events with more than one iso- 
lated lepton are vetoed. For the D0 WH Ivbb anal- 
yses, two and three jet events are analyzed separately. 
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and in each of these samples two non-overlapping b- 
tagged samples are defined, one being a single "tight" 
&-tag (ST) sample, and the other a double "loose" 
&-tag (DT) sample. The tight and loose 6-tagging cri- 
teria are defined with respect to the mis-identification 
rate that the 6-tagging algorithm yields for light quark 
or gluon jets ("mistag rate") typically < 0.5% or 
< 1.5%, respectively. The final variable is a neural 
network output which takes as input seven kinemat- 
ics variables and a matrix element discriminant for 
the 2 jet sample, while for the 3 jet sample the dijet 
invariant mass is used. In this combination, we add a 
new analysis WH — s- riybb in which the r is identified 
through its hadronic decays, this analysis is sensitive 
to ZH — > rlprphh as well in those cases where a r fails 
to be identified. The analysis is carried out according 
to the type of reconstructed r and is also separated 
into two and three jets with DT events only. It uses 
the dijet invariant mass of the bb system as discrimi- 
nant variable. 

For the CDF WH ivbb analyses, the events are 
grouped into six categories. In addition to the se- 
lections requiring an identified lepton, events with an 
isolated track failing lepton selection requirements are 
grouped into their own categories. This provides some 
acceptance for single prong tau decays. Within the 
lepton categories there are three ^-tagging categories 
- two tight 6-tags (TDT), one tight &-tag and one loose 
&-tag (LDT), and a single, tight, 6-tag (ST). In each 
category, two discriminants are calculated for each 
event. One neural network discriminant is trained at 
each niH in the test range, separately for each cat- 
egory. A second discriminant is a boosted decision 
tree, featuring not only event kinematic and 6-tagging 
observables, but matrix element discriminants as well. 
These two discriminants are then combined together 
using an evolutionary neural network ^22i to form a 
single discriminant with optimal performance. 

For the ZH — > vvbb analyses, the selection is sim- 
ilar to the WH selection, except all events with iso- 
lated leptons are vetoed and stronger multijet back- 
ground suppression techniques are applied. Both CDF 
and D0 analyses use a track-based missing transverse 
momentum calculation as a discriminant against false 
frp. There is a sizable fraction of WH ii/bb sig- 
nal in which the lepton is undetected, that is selected 
in the ZH vvbb samples, so these analyses are 
also refered to as VH l^j^bb. The CDF analysis 
uses three non-overlapping samples of events (TDT, 
LDT and ST as for WH) while D0 uses a sample of 
events having one tight 5-tag jet and one loose &-tag 
jet. CDF used neural-network discriminants as the fi- 
nal variables, while D0 uses boosted decision trees as 
advanced analysis technique. 

The ZH — > l^l^hb analyses require two iso- 
lated leptons and at least two jets. They use non- 
overlapping samples of events with one tight 6-tag 
and two loose 5-tags. For the D0 analysis neural- 



network and boosted decision trees discriminants are 
the final variables for setting limits (depending on 
the sub-channel), while CDF uses the output of a 2- 
dimensional neural-network. CDF corrects jet ener- 
gies for Ipj' using a neural network approach. In this 
analysis also the events are divided into three tagging 
categories: tight double tags, loose double tags, and 
single tags. 

For the H W^W^ analyses, signal events are 
characterized by a large Ip^ and two opposite-signed, 
isolated leptons. The presence of neutrinos in the fi- 
nal state prevents the reconstruction of the candidate 
Higgs boson mass. D0 selects events containing elec- 
trons and muons, dividing the data sample into three 
final states: e+e^, e^/x^, and CDF separates 

the H W^W~ events in five non-overlapping sam- 
ples, labeled "high s/5" and "low s/6" for the lepton 
selection categories, and also split by the number of 
jets: 0, 1, or 2-\- jets. The sample with two or more 
jets is not split into low s/b and high s/b lepton cate- 
gories. The division of events into jet categories allows 
the analysis discriminants to separate three different 
categories of signals from the backgrounds more effec- 
tively. The signal production mechanisms considered 
are gg ^ H ^W+W' , WH+ZH jjW+W', and 
the vector-boson fusion process. The final discrimi- 
nants are neural-network outputs for D0 and neural- 
network output including likelihoods constructed from 
matrix-element probabilities (ME) as input to the 
neural network, for CDF, in the 0-jet bin, else the ME 
are not used. All analyses in this channel have been 
updated with more data and analysis improvements. 

The CDF collaboration also contributes an analy- 
sis searching for Higgs bosons decaying to a tau lep- 
ton pair, in three separate production channels:direct 
pp ^ H production, associated WH or ZH pro- 
duction, or vector boson production with H and for- 
ward jets in the final state. In this analysis, the final 
variable for setting limits is a combination of several 
neural-network discriminants. 

D0 also contributes a new analysis for the final 
state TT jet jet, which is sensitive to the VH —^ jjrr, 
ZH TTbb, VBF and gluon gluon fusion (with two 
additional jets) mechanisms. It uses a neural network 
output as discriminant variable. 

The CDF collaboration introduces a new all- 
hadronic channel, WH + ZH jjbb for this com- 
bination. Events are selected with four jets, at least 
two of which are 6-tagged with the tight 6-tagger. The 
large QCD backgrounds are estimated with the use of 
data control samples, and the final variable is a matrix 
element signal probability discriminant. 

The D0 collaboration contributes three WH 
WW^W~ analyses, where the associated W boson 
and the W boson from the Higgs boson decay which 
has the same charge are required to decay leptonically, 
thereby defining three like-sign dilepton final states 
(e^e*, e^/i*, and ^^fi^) containing all decays of the 
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third W boson. In this analysis, which has not been 
updated for this combination, the final variable is a 
likelihood discriminant formed from several topologi- 
cal variables. CDF contributes a WH WW+W' 
analysis using a selection of like-sign dileptons and a 
neural network to further purify the signal. D0 also 
contributes an analysis searching for direct Higgs bo- 
son production decaying to a photon pair in 4.2 fb~^ 
of data. In this analysis, the final variable is the in- 
variant mass of the two-photon system. 

Another new search from D0 is included in this 
combination, namely the search for ttH — > ttbb. Here 
the samples are analyzed independently according to 
the number of 6-tagged jets (1,2,3, i.e. ST,DT,TT) 
and the total number of jets (4 or 5) . The total trans- 
verse energy of the reconstructed objects (H-r) is used 
as discriminant variable. 

All Higgs boson signals are simulated using 
PYTHIA j23], and CTEQ5L or CTEQ6L [24J 
leading-order (LO) parton distribution functions. The 
gg H production cross section is calculated at 
NNLL in QCD and also includes two- loop electroweak 
effects; see Refs. [25l[26] and references therein for the 
different steps of these calculations. The newer calcu- 
lation includes a more thorough treatment of higher- 
order radiative corrections, particularly those involv- 
ing b quark loops. The gg —t H production cross 
section depends strongly on the PDF set chosen and 
the accompanying value of as . The cross sections used 
here are calculated with the MSTW 2008 NNLO PDF 
set [30]. The new gg ^ H cross sections supersede 
those used in the update of Summer 2008 [51 [571 US] > 
which had a simpler treatment of radiative correc- 
tions and used the older MRST 2002 PDF set [H]- 
The Higgs boson production cross sections used here 
are listed in (originally from [25^). We include all 
significant Higgs production modes in the high mass 
search: besides gluon-gluon fusion through a virtual 
top quark loop (ggH), we include production in asso- 
ciation with a.W ov Z vector boson (VH) [HI 131132], 
and vector boson fusion (VBF) (251 IMj. 

The Higgs boson decay branching ratio predictions 
are calculated with HDECAY [35 . For both CDF and 
D0, events from multijet (instrumental) backgrounds 
("QCD production") are measured in data with differ- 
ent methods, in orthogonal samples. For CDF, back- 
grounds from other SM processes were generated us- 
ing PYTHIA, ALPGEN [36;, MC@NLO [37] and 
HERWIG [38 programs. For D0, these backgrounds 
were generated using PYTHIA, ALPGEN, and 
COMPHEP [39,, with PYTHIA providing parton- 
showering and hadronization for all the generators. 
These background processes were normalized using ei- 
ther experimental data or next-to-leading order calcu- 
lations (from MCFM [40^ for W+ heavy flavor pro- 
cess). 

Integrated luminosities, and references to the col- 
laborations' public documentation for each analysis 



are given in Table |l] for CDF and in Table [H] for D0. 
The tables include the ranges of Higgs boson mass 
(m//) over which the searches were performed. 



III. DISTRIBUTIONS OF CANDIDATES 



The number of channels combined is quite large, 
and the number of bins in each channel is large. 
Therefore, the task of assembling histograms and 
checking whether the expected and observed limits 
are consistent with the input predictions and observed 
data is difficult. We therefore provide histograms that 
aggregate all channels' signal, background, and data 
together. In order to preserve most of the sensitivity 
gain that is achieved by the analyses by binning the 
data instead of collecting them all together and count- 
ing, we aggregate the data and predictions in narrow 
bins of signal-to-background ratio, s/b. Data with 
similar s jb may be added together with no loss in sen- 
sitivity, assuming similar systematic errors on the pre- 
dictions. The aggregate histograms do not show the 
effects of systematic uncertainties, but instead com- 
pare the data with the central predictions supplied by 
each analysis. 

The range of sjb is quite large in each analysis, 
and so log]^Q(s/&) is chosen as the plotting variable. 
Plots of the distributions of log]^Q(s/6) are shown for 
ran = 115 and 165 GeV/c^ in Figure [l] These dis- 
tributions can be integrated from the high-s/fe side 
downwards, showing the sums of signal, background, 
and data for the most pure portions of the selection 
of all channels added together. These integrals can be 
seen in Figure |2] 



m,[=llSGeV/c^ 


" Tevatron Data 
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logj„(s/b) 



logj„(s/b) 



FIG. 1: Distributions of logifl(s/6), for the data from 
all contributing channels from CDF and D0, for Higgs 
boson masses of 115 and 165 GeV/c^ The data are shown 
with points, and the signal is shown stacked on top of the 
backgrounds. Underflows and overflows are collected into 
the bottom and top bins. 
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Integrated Expected Signal Integrated Expected Signal 



FIG. 2: Integrated distributions of s/b, starting at 
the high s/b side. The total signal+background and 
background-only integrals are shown separately, along 
with the data sums. Data are only shown for bins that 
have data events in them. 

IV. COMBINING CHANNELS 

To gain confidence that the final result does not de- 
pend on the details of the statistical formulation, we 
perform two types of combinations, using the Bayesian 
and Modified Frequentist approaches, which give sim- 
ilar results (within 10%). Both methods rely on distri- 
butions in the final discriminants, and not just on their 
single integrated values. Systematic uncertainties en- 
ter as uncertainties on the expected number of signal 
and background events, as well as on the distribution 
of the discriminants in each analysis ( "shape uncer- 
tainties"). Both methods use likelihood calculations 
based on Poisson probabilities. Detailed descriptions 
of the techniques can be found in [1] as well as in |2] 
for the Bayesian Approach and in |3J STJ for the 
Modified Frequentist Approach. 



A. Systematic Uncertainties 

Systematic uncertainties differ between experiments 
and analyses, and they affect the rates and shapes 
of the predicted signal and background in correlated 
ways. The combined results incorporate the sensitiv- 
ity of predictions to values of nuisance parameters, 
and include correlations, between rates and shapes, 
between signals and backgrounds, and between chan- 
nels within experiments and between experiments. 
More on these issues can be found in the individual 
analysis notes [Sj through [5T] and in the combination 
note [1]. 

V. COMBINED RESULTS 

Using the combination procedures outlined in [T], 
we extract limits on SM Higgs boson production 
a X B{H X) in pp coUisions at ^/s — 1.96 TcV for 
niH = 100 — 200 GeV/c^. To facilitate comparisons 



with the standard model and to accommodate anal- 
yses with different degrees of sensitivity, we present 
our results in terms of the ratio of obtained limits to 
cross section in the SM, as a function of Higgs boson 
mass, for test masses for which both experiments have 
performed dedicated searches in different channels. A 
value of the combined limit ratio which is less than or 
equal to one would indicate that that particular Higgs 
boson mass is excluded at the 95% C.L. 

The combinations of results of each single experi- 
ment, as used in this Tevatron combination, yield the 
following ratios of 95% C.L. observed (expected) lim- 
its to the SM cross section: 3.6 (3.2) for CDF and 
3.7 (3.9) for D0 at niH = 115 GeV/c^, and 1.5 (1.6) 
for CDF and 1.3 (1.8) for D0 at mn = 165 GeV/c^. 

The ratios of the 95% C.L. expected and observed 
limit to the SM cross section are shown in Figure [3] for 
the combined CDF and D0 analyses. Tables with the 
observed and median expected ratios as obtained by 
the Bayesian and the CLs methods are listed in [T]. 
In the following summary we quote only the limits 
obtained with the Bayesian method since they are 
slightly more conservative (based on the expected lim- 
its) for the quoted values, but all the equivalent num- 
bers for the CLs method can be retrieved from [T] . We 
obtain the observed (expected) values of 2.5 (2.4) at 
niH = 115 GeV/c^, 0.99 (1.1) at wih = 160 GeV/c^, 
0.86 (1.1) at ruH = 165 GeV/c^, and 0.99 (1.4) at 
TTiH = 170 GeV/c2. We exclude at the 95% C.L. 
the production of a standard model Higgs boson with 
mass between 160 and 170 GeV/c^. This result is 
obtained with both Bayesian and CLs calculations. 

We also show in Figure |4] the I-CL5 distribution 
as a function of the Higgs boson mass, at high mass 
(> 150 GeV/c^) which is directly interpreted as the 
level of exclusion at 95% C.L. of our search. Note 
that this figure is obtained using the CLs method. 
The 90% C.L. line is also shown on the figure. We 
provide the Log-likelihood ratio (LLR) values for our 
combined Higgs boson search, as obtained using the 
CLs method in [1 . 

In summary, we have combined all available CDF 
and D0 results on SM Higgs search, based on lumi- 
nosities ranging from 0.9 to 4.2 fb^^. Compared to our 
previous combination, new channels have been added 
and most previously used channels have been rean- 
alyzed to gain sensitivity. We use the latest parton 
distribution functions and gg H theoretical cross 
sections when comparing our limits to the SM predic- 
tions at high mass. 

The 95% C.L. upper limits on Higgs boson 
production are a factor of 2.5 (0.86) times the 
SM cross section for a Higgs boson mass of 
niH =115 (165) GeV/c^. Based on simulation, the 
corresponding median expected upper limits are 2.4 
(1.1). Standard Model branching ratios, calculated 
as functions of the Higgs boson mass, are assumed. 
These results extend significantly the individual limits 
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of each collaboration and our previous combination. 
The mass range excluded at 95% C.L. for a SM Higgs 
has been extended to 160 < rrin < 170 GeV/c^. The 
sensitivity of our combined search is expected to grow 
substantially in the near future with the additional lu- 
minosity already recorded at the Tevatron and not yet 
analyzed, and with additional improvements of our 
analysis techniques which will be propagated in the 
current and future analyses. 



FIG. 3: Observed and expected (median, for the 
background-only hypothesis) 95% C.L. upper limits on the 
ratios to the SM cross section, as functions of the Higgs 
boson mass for the combined CDF and D0 analyses. The 
limits are expressed as a multiple of the SM prediction for 
test masses (every 5 GeV/c^) for which both experiments 
have performed dedicated searches in different channels. 
The points are joined by straight lines for better read- 
ability. The bands indicate the 68% and 95% probability 
regions where the limits can fluctuate, in the absence of 
signal. The limits displayed in this figure are obtained 
with the Bayesian calculation. 
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FIG. 4: Distributions of 1-CLs as a function of the Higgs 
boson mass (in steps of 5 GeY/c^), as obtained with CLs 
method, for the combination of the CDF and D0 analyses. 
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